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Abstract 

Crowdsourcing of jobs to online freelance markets is rapidly gaining popularity. Most crowdsourcing platforms 
are uncontrolled and offer freedom to customers and freelancers to choose each other. This works well for unskilled 
jobs (e.g., image classification) with no specific quality requirement since freelancers are functionally identical. For 
skilled jobs (e.g., software development) with specific quality requirements, however, this does not ensure that the 
maximum number of job requests is satisfied. In this work we determine the capacity of freelance markets, in terms 
of maximum satisfied job requests, and propose centralized schemes that achieve capacity. To ensure decentralized 
operation and freedom of choice for customers and freelancers, we propose simple schemes compatible with the 
operation of current crowdsourcing platforms that approximately achieve capacity. Further, for settings where the 
number of job requests exceeds capacity, we propose a scheme that is agnostic of that information, but is optimal 
and fair in declining jobs without wait. 


Index Terms 

freelance markets, capacity, queuing theory, decentralized algorithms 


I. Introduction 


Methods and structures for information processing have been changing. Enabled by the proliferation of modern 
communication technologies, globalization and specialization of workforces has led to the emergence of new 
decentralized models of informational work. Moreover, the millennial generation now entering the workforce often 
favors project-based or job-based work, as in crowdsourcing and social production j2], |3i], rather than long-term 
commitments |4jj. Indeed over the last decade, more than 100 ‘human clouds’ have launched with a variety of 
structures. These platforms serve clients by harnessing external crowds, and global enterprises similarly harness 
their internal crowds 0-0’ making use of human cognitive surplus for information processing |8j. 

Platforms follow different collective intelligence models |9|, 1101, and require different strategies for allocating 
informational work to workers. In crowdsourcing contest platforms like InnoCentive and TopCoder, there is self- 
selection: work is issued as an open call and anyone can participate in any job; the best submission wins the reward 
GD-G3- In microtask crowdsourcing platforms like Amazon Mechanical Turk, any worker is assumed able to 
do any job and so first-come-first-serve strategies are often used; level of reliability may be considered in optimal 
allocation [15). In freelance markets like oDesk and Elance, however, specialized jobs must be performed by skilled 
workers: allocation requires careful selection from the large pool of variedly-skilled freelancers. 

Freelance market platforms serve as spot markets for labor by matching skills to tasks, often performing on- 
demand matching at unprecedented scales. For example, oDesk had 2.5 million workers and nearly 0.5 million clients 


in 2013 [10]. Herein we study allocation and scheduling of informational work within these kinds of platforms, via 
a queuing framework. We aim to establish fundamental limits through a notion of work capacity, and also develop 
decentralized algorithms, which are easily-computed, that nearly achieve these performance limits. 

Freelancers may have one or more skills (that are known, cf. [16)) and jobs may have multiple parts, called tasks, 
that require separate skills. Due to job skill requirement variety and limited freelancer ability, it is often not possible 
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to find a freelancer that meets all requirements for a job: a job may have to be divided among freelancers. Moreover, 
a task in a job may require so much time that even the task may have to be divided among multiple freelancers. 
There are reputation systems within freelance market platforms, so freelancers have a reputation level as well as 
minimum acceptable hourly rate and skills, which allow worker categorization. Some freelancers are adaptable in 
terms of hours available to spend on a particular type of task, whereas others pre-specify hours available for each 
kind of task. Here we consider the non-adaptable setting where, for example, a freelancer may be available for 20 
hours (per week) of any C++ or Java programming, or may be available for 10 hours (per week) of C++ and 10 hours 
of Java. Studying limits for adaptable freelancers and designing centralized schemes (and their approximations) are 
similar, but the distributed schemes require a different approach. 

The objective of the platform is to find a good allocation of jobs (and tasks) to freelancers. Since working on a 
task requires synchronization among freelancers, work can only start when the whole task has been allocated. On 
the other hand, for some jobs there are interdependencies between different tasks 1171 and hence, for these jobs all 
tasks must be allocated before the job starts. Moreover, some jobs may require all parts to be done by freelancers 
with the same level of expertise for uniform quality and money spent. These considerations lead to concepts of 
decomposability and flexibility that are central to our development. 

In the ethos of self-selection, it is desirable for crowd systems to not be centrally controlled, but rather for jobs 
and freelancers to choose each other. Currently, this may happen randomly or greedily. This is clearly not optimal, 
as the following example illustrates. Consider two types of jobs (single task) and two categories of freelancers. A 
type 1 job can be served by either of the worker categories (example, lower reputation requirement) whereas type 2 
jobs can only be served by category 2 workers. If freelancers and jobs are allocated arbitrarily then it may happen 
that type 1 uses many category 2 freelancers and many type 2 jobs remain unserved. 

Optimal centralized allocation of informational tasks under the constraints of crowd systems is related to hard 
combinatorial problems such as the knapsack problem. Compared to scheduling problems in computer science 1181, 
communication networks 119 1 , |20|, and operations research |[2T], crowd systems face challenges of freedom of 
self-selection, need for decentralized operation, and uncertainty in resource availability. 

Prior works in the information theory, networking, and queueing literatures are similar to our work in terms of 
theoretical framework, performance metrics, and the nature of performance guarantees, but are not directly related. 
The notion of capacity of a resource-shared system where jobs are queued until they arc served and the notion of a 
capacity-achieving resource allocation scheme for this kind of system came to prominence with the work of Tassiulas 
and Ephremides [221, [23]. The capacity concept and capacity-achieving schemes were subsequently developed for 
applications in communication networks [ 191, 1201, [241, [25;], cloud computing [[261, online advertising [j27j, |28| , 
and power grids [29], among others. With the advent of cloud services, large-scale systems have attracted significant 
research interest: resource allocation schemes and their performance (queueing delays, backlogs, etc.) in the large- 
scale regimes have been studied [30]-|33[. 

In this paper, our goal is to understand the fundamental limits (capacity) of freelance markets and ways to 
achieve this ultimate capacity. We first develop a centralized scheme for achieving these maximum allocations 
where a central controller makes all job allocation decisions. Given the potential large scale of platforms, we also 
discuss low-complexity approximations of the centralized scheme that almost achieve the limit. Finally, with an eye 
towards giving flexibility to customers (job requesters) in choosing freelancers, we propose simple decentralized 
schemes with minimal central computation that have provable performance guarantees. Further, since job arrival 
and freelancer availability processes are random (and sometimes non-stationary), we also address ways to adapt 
when the system is operating outside its capacity limits. 


II. System Model 

We first provide formal definitions of the nature of informational work and workers, and establish notation. 

Freelancers (or agents) are of L categories. In each category l £ [L], there are M l types of agents depending 
on their skill sets and available hours. There are S skills among agents of all categories and types. An agent of 
category l and type i has a skill-hour vector h\, i.e. h\ s available hours for work involving skill s e[S\. 

Jobs posted on the platform are of N types. Each type of job j £ [N] needs a skill-hour service ry, i.e. ry. s 
hours of skill s. A part of a job of type j involving skill s is called a (j, s)-task if r h , s > 0, which is the size of 
this task. 
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A job of type j can only be served by agents of categories l G A f(j) C [L\. This restriction is captured by a 
bipartite graph G = ([A], [L\, E ), where a tuple (j. 1) f E C [A"] x [L] implies that category l agents cannot serve 
jobs of type j. 

On the platform, jobs are allocated at regular time intervals to available agents, these epochs are denoted by 
t G {1,2,...}. Jobs that arrive after epoch t has started are considered for allocation in epoch t + 1, based on 
agents available at that epoch. Unallocated jobs (due to insufficient number of skilled agents) are considered again 
in the next epoch. 

Jobs arrive according to a Z^-valued stochastic process A(t) = (Ai(f), ^(t),... ,Aj\r(t)), where Aj(t) is the 
number of jobs of type j that arrive in scheduling epoch t. 

The stochastic process of available agents at epoch t is U(t) = (U 1 (f), U 2 (f),..., U L (f)). For each agent 
category l, V(t) = ([/{(f), [/{(f),..., U l M ft )) denotes the number of available agents of different types at epoch 
t. 

We assume processes A(f) and U(f) are independent of each other and that each of these processes is independent 
and identically distributed for each f Wc also assume that each of these processes has a bounded (Frobenius norm) 
covariance matrix. Let T(-) be the distribution of U(f), and let A = E[A(f)] and ^i l = E[U ; (i)] for l G [L\ be the 
means of the processes. 

At any epoch t, only an integral allocation of a task (say (j, s )) is possible. A set of tasks t±, t 2 , ■ ■ ■, t n of size 
ri, r 2 ,..., r n of skill s can be allocated to agents 1,2,..., rn only if available skill-hours for skill s of these agents 
li \. h 2 ,..., hyyi satisfy 

n m 

^ Vip < hi, ^ v q j > rj,j G [n], i G [to] 
p = i <?=t 

for some { v pq > 0}. 

Whether different tasks of a job can be allocated at different epochs and across different categories of agents 
depend on the type of the job. 

Definition 1. A type of job j is called non-decomposable (decomposable) if different tasks comprising it are (are 
not) constrained to be allocated at the same epoch. 

Definition 2. A type of job j is called inflexible (flexible) if different tasks as well parts of tasks comprising it are 
(are not) constrained to be allocated to the same category of agents. 

In a system with only decomposable jobs, given a set of {u ; = (u[,u l 2 , ■ ■ ■ ,u l Mt ) : l G [ L ]} agents (that is, u\ 
agents of category l and of type i within that category), a number ay s of (j, ,s)-tasks can be allocated only if there 
exist non-negative {z l j s : ( l,j,s ) G [L\ x [N] x [S']} satisfying 

4s = a ;M> z j, s = 0 if (i- 0 i E i for a11 j e [N], s g [S], 

i 

z j,s r hs < J2 u i h[ s , for alU G [L],s G [S]. (1) 

j G [N] 

On the other hand, given a set of {u^ = (u\. u l 2 , ■ • •, u l Ml ) : l G [T]} agents in a system with only non- 
decomposable jobs, aj jobs of type j (for each j ) can be allocated only if there exist non-negative { z 1 - s : (/, j) G 
[L\ x [N]} satisfying 

0 and aj tS = (ij for all j , s. (2) 

Intuitively, the conditions imply that required skill-hours for the set of jobs is less than the available skill-hours 
of agents. The {z 1 - s } capture a possible way of dividing tasks across multiple category of agents, as they can be 
interpreted as the number (possibly fraction) of (j, s)-tasks allocated to /-category agents. Note that conditions ([T} 
and Q are necessary for allocations of decomposable and non-decomposable jobs respectively. These conditions 
only imply that there exist possible ways of splitting jobs and tasks across different categories of agents to ensure 
integral number of tasks (jobs) are allocated in case of decomposable (non-decomposable) jobs. 

'Most of our results can be extended to stationary ergodic processes. 
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For a system with only flexible jobs, different parts of a task can be allocated to different categories and a category 
can be allocated parts of tasks. Hence, {zj s } can possibly take any value in . Thus flexible and decomposable 
(non-decomposable) jobs need to satisfy condition ([T]) (condition Q) which we refer to as FD (FND). 

For inflexible jobs, a necessary condition the allocation must satisfy is that each category gets the same integral 
number of (j, s)-tasks for all s,j, i.e., 


4® £ s.t. z l j s = z' j s , for all s,s',j. 


(3) 


An allocation of inflexible and decomposable (non-decomposable) jobs needs to satisfy conditions 0 (condition 
([2])) and ([3]), which we refer to as ID (IND). 

For simplicity, in this work we focus on systems with only a single one of these four classes of jobs^For brevity 
we use the same abbreviations to refer to class of job, as we use for the necessary conditions. Thus, we have FD, 
FND, ID, and IND systems. 

In crowd systems, the scaling of number of job and agent types, rate of job arrivals, and number of available 
agents is as follows: X(N) = -\j scales faster than N, i.e. X(N) = oj(N) or lirrry^oc N/X(N) = 0 and the 

number of skills S scale slower than N, i.e. S = o(N). In practice, a job requires at most a constant number of 
skills d, implying there are f l(S d ) possible job types. On the other hand, the number of skills of an agent d 1 < d as 
a job generally requires more diversity than a single agent possesses, implying M = M l = 0(S d ). L = 0(1), 
as it relates to variation in reputation levels and hourly rates, and so M = o(N). Beyond these system scalings 
seen in practice, we assume A j(N) = cu(l),Vj E [ N] and Ylj-r- >o A-,(iV) = D (N c ) for all s E [S ], for some 
c > 0. In the sequel, we assume these scaling patterns and refer to them as crowd-scaling. 


III. Capacity, Outer Region, and Centralized Allocation 

In this section we study the limits of a freelance market with centralized allocation and present a centralized 
algorithm that achieves the limit. We also discuss a simpler upper bound for the capacity region in terms of first-order 
statistics of the system. These results on ultimate system limits and ways to achieve them are not only important in 
their own right, but also serve as benchmarks for later discussion of decentralized schemes that provably achieve 
nearly the same limits. 

To formally characterize the maximal supportable arrival rate of jobs we introduce some more notation. For each 
j E [N], let Qj(t) be the number of unallocated jobs that are in the crowd system just after allocation epoch t — 1. 
As defined above, Aj(t) is the number of jobs of type j that arrive between starts of epochs t—1 and t. Let Dj(t) 
be the number of jobs of type j that have been allocated to agents at epoch t; we call a job allocated only when 
all parts have been allocated. Thus the evolution of the process Qj(t) can be written as: 

Qj(t + 1) = Qj(t ) + Aj(t) — Dj{t). (4) 

Note that at any epoch t, at most Qj(t) + Aj(t) type j jobs can be allocated, as this is the total number of type j 
jobs at that time and hence Dj(t) < Qj(t) + Aj(t), implying Qj(t) > 0. 

Notation and Convention. We denote the interior and the closure of a set C by C and C, respectively. When 
we say A = (Ai, A 2 ,..., Ayr) E A C M.+ s we mean X s = ((Ai, Ai,..., S times ), (A 2 , A 2 ,..., S times ),...) E A. 
Also, whenever we say A C (D)A 7 for A' C R+, we mean for any X E R+, A E A' <^= (=>)A S E A. 

Definition 3. An arrival rate A is stabilizable if there is a job allocation policy V under which Q (t) = ( Qj(t),j E [JV]) 
has a finite expectation, i.e., limsup^^ E[<3j(f)] < 00 , for all j. The crowd system is called stable under this 
policy. 

Definition 4. Cr, a closed subset of is the capacity region of a crowd system for a given distribution l of the 
agent-availability process if any A E Cr is stabilizable and any X f C,y is not stabilizable. 

Extension to combinations of multiple classes is not much different but requires more notation. 
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A. Capacity Region and Outer Region 

Let us characterize the capacity regions of different classes of crowd systems. For any given set of available 
agents u = [u\ : 1 < i < M l , 1 < l < L), we define the set of different types of tasks (a hS ) that can be allocated 
in a crowd system. Note that the necessary conditions to be satisfied for tasks to be allocated are specific to the 
class of crowd system. 

Using the explicit conditions ([!]), ([2]), and ([3]) for tasks (jobs) to be allocated, we define (7 FD (u), C' FND (u), 
C ID (u), and C IND (u) as the set of tasks that can be allocated in FD, FND, ID, and IND systems respectively for 
given availability u. We denote these sets generically by C(u) and refer to conditions FD, IFD, FND, and IND 
generically as crowd allocation constraint or CAC. 

C(u) := |(a JiS G Z + ) : 3 (z l js ) satisfying CAC j , 
and C(u) is the convex hull of C(u). 

The following theorem generically characterizes capacity regions of different crowd systems. 

Theorem 5. Given a distribution F of agent-availability, i.e., F(u) = Pr(U(t) = u), for a A (f C(T) there exists 
no policy under which the crowd system is stable, where 

C(r) = \ A = r ( u ) A ( u ) = M = E M? ’ A ( u ) G C ( u ) 

[ uezf i 

For FD, FND, and IND systems, for any A G C (T) there exists a policy such that the crowd system is stable. 

Proof: See Appendix Il|Aj ■ 

This implies that for FD, FND, and IND systems capacity region Cy = C(T) and for ID systems Cy C C(F) 
(possibly strict). Note that the conditions FD, FND, ID, and IND (generically CAC) are necessary conditions for a 
valid allocation. The above theorem implies these conditions are also sufficient, except for ID systems. In Sec. |VJ 
we present an alternate characterization of the capacity region for inflexible systems. 

Note C(r) depends on the distribution of agent availability T, but it is hard to obtain this distribution for large 
and quickly-evolving systems in practice. Flence, a characterization in terms of simpler system statistics is of use. 
Below is a characterization of a region beyond which no arrival rate can be stabilized. Borrowing terminology from 
multiterminal Shannon theory, we call this the outer region. 

For any set J C [ N ], define A f(J) = {( G [L\ : 3j G J s.t. ( j , l) G E} and the closed subset of , 

A:VJc[iV],V S ,^A^ )S < ]T 

[ jeJ ieN{J) i£M l 

Theorem 6. For any distribution F with mean p, Cy C C°“( 

Proof: See Appendix Hi ■ 

In general, Cr is a strict subset of because only captures the balance of skill-hours in the crowd- 
system, i.e. average skill-hours requirement is no more than average availability, but partial allocation of a task is 
not acceptable in a crowd system. Moreover, for non-decomposable jobs all tasks of a job have to be allocated 
simultaneously. Flence, meeting an average skill-hour balance criterion may be far from being sufficient for stability. 
For inflexible systems the requirements are even stricter, which is likely to increase the gap between the outer region 
and the true capacity region. In Sec. [V] we present a tighter outer region for inflexible systems. 

In certain scenarios Cjj" 7 ' may be non-empty when Cy is empty. For example, consider a simple non-decomposable 
crowd system with N = L = 1 and M 1 = S = 2. Let each job require 1 hour of both skills, type i agents have 
only 1 hour available for skill i and none for other skills, U 1 (t) be uniformly distributed on {(0,10), (10,0)}, and 
A = (4,4). Then clearly A G C'™ 1 . but note that at any time there is only one type of skill available, hence no job 
can be allocated. This implies Cr = 0- 
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B. Centralized Allocation 

Though there exists a policy for each A G Cp that stabilizes the system, these policies may differ based on 
A and may depend on the job-arrival and agent availability statistics. Changing policies based on arrival rate and 
statistics is not desirable in practical crowd systems due to the significant overhead. Below we describe a centralized 
statistics-agnostic allocation policy which stabilizes any A G Cp. Later we discuss computational cost of this policy 
for different classes of crowdsourcing system and present simpler distributed (or almost distributed) schemes with 
provable performance guarantees under some mild assumptions. 

To describe the scheme we introduce some more notation. Let Qj, s (t) be the number of s-tasks (skill s) of type 
j jobs just after the allocation epoch t — 1 and let I) hS (t) be the number of s-tasks (skill s) of type j jobs allocated 
at epoch t. Then 

Qj, s (t + 1) = Qj, s (t) + Aj(t) 1 (r jjS > 0) - Dj )S (t). 

Note that for all t, due to the CAC condition on allocation, Dj >s (t) G C(U(t)). Moreover, there is an additional 
restriction that Dj tS (t ) < Qj, s (t) + Aj(t), as there are Q hS (t) + Aj(t) part s of job type j in the system at that 
time, which in turn implies Qj )S (t ) G Z+ for all t. Note that as Dj tS (t) G C'(U(f)), for non-decomposable systems 
Qj,s = Qj,s> f° r all jt s ) s 'whereas for decomposable systems they may differ. 

Algorithm 1 Max Weight Task Allocation (MWTA) 

Input: {Qj >s (t) ■ j € [N], s G [S']}, A(t) and U(t) at t 
Output: Allocation of jobs to agents 

MaxWeight 

(zUt) ■■ l,j, s) = arg max V Q j>s (t)A jtS 

s.t. (zjj) satisfy CAC with aj tS = A j >s (t)\/j,s. 

Task Allocation 
for j = 1 : N do 

Order j-type jobs arbitrarily, Oj 

for s = 1 : S do 

Use order Oj among non-zero (j, s)-tasks 

1 = 1 

while l < L and Ylk2\ z j, s < Qj,s(t ) + Aj(t) do 

Allocate Efc=i z j, s '■ Ylk =l z k\ (a s ) tasks to category l. Here tasks [x : x + y] arc task set / = 
{[~x~|, • • • [x + yj} (in the ordering Oj), ([~af| — x) fraction of task [x] and 1 + x + y — \x + y\ fraction 
of task \x + y\. 
l<=l + 1 

end while 

end for 
end for 

for l = 1 : L do 

Order agents of category l arbitrarily 

for s = 1 : S do 

Agents pick maximum (as per availability constraint) tasks (or part) in order from min(zj s . Q hS (t) + 

Aj(t)) r j,s hours 

end for 
end for 


We propose the MaxWeight Task Allocation (MWTA) policy, Alg. 1, to allocate tasks to agents at epoch t 
based only on the knowledge of Q (t), A(t), and U(f), and therefore statistics-agnostic. It is based on MaxWeight 
matching [19]], [20j. 

It is apparent that the MaxWeight part of the algorithm finds a (zj s } that satisfies CAC. The following theorem 
implies that MWTA allocates tasks optimally. The proof of the theorem is based on adapting the proof of optimality 
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of the Max Weight algorithm under the constraints and assumptions of crowd systems. It implicitly relies on the 
following result. 

Proposition 7. For any u and Q, and {z l j s } satisfying CAC, the Task Allocation part of MWTA (Alg. 1) gives a 
feasible allocation for FD, FND, and IND systems. 

Proof: See Appendix IljC] ■ 

Theorem 8. MWTA (Alg. 1) stabilizes FD, FND, or IND crowd systems for any arrival rate A G Cr (for respective 

Cr). 

Proof: See Appendix Il|Dj ■ 


IV. Single-category Systems and Decentralized Allocations 


There is effectively a single category of agents in many platforms with a large population of new freelancers, 
whose reputations are based on evaluation tests for skills and who are paid at a fixed rate. Hence designing efficient 
allocation schemes for single-category systems are of particular interest, as this population of agents are significant 
in ever-evolving crowd systems. Insights drawn from single-category systems are also useful in controlling multi¬ 
category systems, Sec. [V] 

For a single category system (L = 1), note that zj s = a,j, s £ and hence the feasibility condition (jTJ) reduces 
to: 

22 a 3^ r j,s < 22 Uihi tS for all s £ [S] , a.j, s £ Z+, 

3 i 

with condition ([2]) additionally requiring a hS = a JtS > for all j,s,s'. Thus, C'(u) is the set of {aj. s } satisfying the 
above conditions for respective classes of jobs and C(T) is the weighted (by T(u)) sum of convex hulls of C(u)s, 
here Cr = C(T). 

C° nt has a simple characterization as well. As for any j 6 [ N ], (j, 1) 6 E, and Af(J) = 1 for all J C [N], 
Y^ieN(J) ^ie[M 1 } F\h\,s = IbAi.s- Thus it is sufficient to satisfy the inequality for J = [N], and hence, 

= : ^je[N] ^j r j — Ylie[M] 

The MaxWeight computation in MWTA for single-category systems turns out to be the following integer linear 
program (ILP), which is related to knapsack problems. 


arg max 2^Q js A js 

s>t ^2 a j,s r j, s < '22 Uih i)S \ls G [5], 

3 i 

Aj jS = AVs, s',j (only for ND) 


(5) 


For decomposable and non-decomposable systems, this is a single knapsack and multi-dimensional knapsack 
problem [34j, respectively, and hence NP-hard. There exist fully polynomial time approximations (FPTAS) for 
single knapsack, whereas for multi-dimensional knapsack only polynomial time approximations (PTAS) are possible 
[34]. With this approximation, say 1 — e, the MWTA policy stabilizes (1 — e)Cr = {A : j £ Cr}- Also, note 
that for large crowd systems each A, is large and hence stabilizing any A with A + 1 G Cr is almost optimal. The 
above ILP can be relaxed to obtain a linear program, an allocation based on which achieves this approximation 
(see Appendix I.A for details). 


A. Decentralized Allocations 

Now we show that due to the structure of the crowd allocation problem and the fact that crowd systems are large, 
simple allocation schemes with minimal centralized control achieve good performance under mild assumptions 
on arrival and availability processes. Interestingly, though the centralized optimal allocation requires solving a 
knapsack problem at each epoch and greedy schemes are known to be sub-optimal for knapsack problems [ 341, we 
propose two simple greedy schemes that are almost optimal with good performance guarantees. One of them, called 
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Greedy Agent allocation provably performs well for decomposable systems and offers the freedom of selection to 
freelancers. Another, called GreedyJob allocation has provable performance guarantees for both decomposable and 
non-decomposable systems while allowing customers (job requesters) the freedom of selection. Thus, in some sense, 
this shows that though greedy algorithms can be suboptimal for an arbitrary allocation instant (at each epoch), for 
a dynamical system over long time, its performance is good. 


Algorithm 2 Greedy Agent Allocation 
Input: A (t) 

Output: Job to agent allocations 
A: set of agents, T : set of tasks 
while A and T non-empty do 

Agents in A contend (pick random numbers) and a wins 
for each skill with non-zero skill hour do 
a picks as many integral tasks as it can pick 
if a has remaining available hour then 

a Picks from remaining parts of the partially allocated task 
if a has remaining available hour then 
a picks part of any unallocated task 
end if 
end if 

Remove fully allocated tasks from T 

end for 

A = A\{a} 

end while 

Tasks with partial allocations are not allocated 


In Greedy Agent allocation (Alg. 2), agents themselves figure out the allocation via contention, without any central 
control. Agents need no knowledge about the agent population, but do need information on the available pool of 
jobs and have to agree on certain norms. In most freelance market platforms, this information is readily available, 
and so an algorithm like this is natural. As expected, this scheme may not be able to stabilize any arrival rate in 
Cr for any ergodic job-arrival and agent-availability processes, but it has good theoretical guarantees under some 
mild assumptions on the job arrival and agent availability processes. 

Definition 9. A random variable X is Gaussian-dominated if E[X 2 ] < E[X] 2 + E[X] and for all 0 E M, 

E[e »(X-E W)] < exp | <WA-1-E[A T )P | 

Definition 10. A random variable X is Poisson-dominated if for all flel, E[e/ YYY ;: AD] < e ] -\ x }( e °- 1 ( 


Note that these domination definitions imply that the variation of the random variable around its mean is dominated 
in a moment generating function sense by that of a Gaussian or Poisson random variable^] Such a property is 
satisfied by many distributions including Poisson and binomial that are used to model arrival processes for many 
systems, e.g., telephone networks, internet, call centers, and some freelance markets [;5|, |6ij. It is not hard to 
show that sub-Gaussian distributions (standard in machine learning [361) that are symmetric around their mean, are 
Gaussian-dominated. 

The following theorem gives a guarantee on the performance of Greedy Agent, under mild restrictions on the job- 
arrival and agent-availability processes. Independence assumptions are not too restrictive for large crowd systems, 
where jobs and agents may come from different well-separated geographies or organizational structures. 


Theorem 11. If the arrival processes {Aj(t)} and the agent availability processes {U,(t)} are i.i.d. across time 
and independent across types {jobs and agents) and all these processes are Gaussian-dominated (and/or Poisson- 
dominated), then for any given a E (0,1], there exists an N a such that GreedyAgent allocation stabilizes any 


^Domination in this sense is used in bandit problems 


35 
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arrival rate A € (1 — a'jC""' : = |A : j- 1 ^ A G C'jf J 1 for any single-category decomposable crowd system with 
N > N a . Moreover, for any arrival rate in (1 — ojCff 1 , at steady state, after an allocation epoch, the number of 
unallocated tasks is ()(S log N) with probability 1 — o j)• 

Proof: See Appendix IljEj ■ 

As Cr C C'f u , this implies that the greedy scheme stabilizes an arbitrarily large fraction of the capacity region, 
under the assumptions on the arrival and availability processes. As S = o(N), more specifically 0(N C ) for c < 1, 
the above bound on number of jobs imply that there are o(N) unallocated tasks at any time. This in turn implies 
that unallocated tasks per type (average across types) is o(l), i.e., vanishingly small number of tasks per type are 
unallocated as the system scales. 

In GreedyAgent, there is no coordination among agents while picking tasks within jobs. Hence in a non- 
decomposable system, many tasks may be picked by agents but only few complete jobs are allocated. As more and 
more jobs accumulate, the chance of this happening increases, resulting in more accumulation. This can result in 
the number of accumulated jobs growing without bound, as formalized below. 

Proposition 12. There exists a class of non-decomposable crowd systems with Poisson-dominated (as well as 
Gaussian-dominated) distributions of arrival and availability, such that the system is not stable under GreedyAgent 
allocation. 

Proof: See Appendix IljF] ■ 

Hence, we propose another simple greedy scheme that works for both decomposable and non-decomposable 
systems. The GreedyJob allocation scheme (Alg. 3) is completely distributed and hence a good fit for crowd 
systems. GreedyJob has similar performance guarantees for both decomposable and non-decomposable systems as 
GreedyAgent has for decomposable systems only. 


Algorithm 3 GreedyJob Allocation 
Input: U(f) 

Output: Job to agent allocations 
J\ set of all jobs 

while Available skill-hours of agents and J / 0 do 
Jobs in J contend (pick random numbers) and J wins 
if J finds agents to allocate all tasks then 
Allocate to those agents 
else 

J does not allocate anything 

end if 

J = J\{J) 

end while 


Theorem 13. If the arrival processes {Aj(t)} and the agent availability processes {11,(1 )} are i.i.d. across time and 
independent across types (jobs and agents), all these processes are Gaussian-dominated (and/or Poisson-dominated) 
and \/s, s' p,ihi jS — - P;hi,s' \ is 0(subpoly(N )), then for any given a G (0,1], 3N a such that GreedyJob 

allocation stabilizes any arrival rate A G (1 — a)C° n< := j A : pz^A G C'jf ' | for any single-category crowd-system 
with N > N a . Moreover, for any arrival rate in (1 — ajC])" 7 , at steady state, after an allocation epoch, total number 
of unallocated jobs (adding all types) is 0(\ogN) with probability 1 — o (^W). 

Proof: See Appendix IljGj ■ 

In Sec. [V] we propose a decentralized scheme for multi-category systems that uses the two single-category 
decentralized schemes as building blocks. At the end of Sec.[V]we briefly discuss the suitability of these decentralized 
schemes for crowd systems in terms of implementability on crowd platforms. 
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V. Multi-Category Systems 


Sec. Ill characterized the capacity region and developed an optimal centralized scheme for crowd systems in, 


whereas Sec. IV discussed simple decentralized schemes for single-category systems. Here we return to multi¬ 
category systems, briefly discussing computational aspects of MWTA, followed by an alternate approach to the 
capacity and outer region of inflexible systems that yields a simple optimal scheme. We also present a decentralized 


scheme based on insights from the optimal scheme and the decentralized allocations in Sec. IV 


The MWTA scheme, which is throughput optimal for FD, FND, and IND systems, involves solving an NP-hard 
problem for multi-category systems. For multi-category systems this is from the general class of packing integer 
programs, for which constant factor approximation algorithms exist under different assumptions on the problem 


parameters [371. These assumptions do not generally hold for Max Weight allocation under the CAC constraint. 


Rather, we follow the same steps of LP relaxation and obtain a scheme that stabilizes any A for A + 1 G Cr, since 
for large systems this is better than any arbitrarily close approximation scheme (as A —> oo as N -A oo). 

A. Inflexible System 

Below we present a characterization of the capacity region of inflexible systems in terms of the bipartite graph 
G = (V, E), which captures the restriction of job-agent allocations. 

Theorem 14. Any A can be stabilized if A G C 1 , where 


C 1 = 


A = J2 a(0 : a(0 e 4?> A 5 = 0 f° r al1 O', 0 i E 

ie[L\ 


where Cy' is the capacity region of a single category system with an agent availability distribution T; = F (Uj : i G [M 1 ]). 
Moreover, no A f C 1 can be stabilized, i. e. , for inflexible systems the capacity region Cr = C 1 . 

Proof: See Appendix IljHj ■ 

This theorem has the following simple consequence. Consider separate pools of agents for each different category, 
cf. [71, which has agent-availability distributions {T; : l G [LIT. Each such pool (category) of agents l can stabilize 

”—” °(l) 

job-arrival rates in Thus if the job arrival process of each job type j can be split in such a way that pool 
(category) l of agents sees an arrival rate A 1 -, where A l - > 0 only if (j, l ) G E, while ensuring that { A I J : j } G c|P, 
the system would be stable. 

In a server farm where jobs can be placed on any of the server queues, the join-shortest-queue (JSQ) policy 


stabilizes any stabilizable rate 201. JSQ gives an arriving job to the server with the shortest queue and each server 


serves jobs in FIFO order. For multi-category crowd systems, we can draw a parallel between servers and agent 
pools. In addition we have constraints on job placement given by G and also have to do allocations of jobs among 
the agents in the pool optimally (unlike JSQ we do not have FIFO/LIFO specified). Thus we have to adapt JSQ 
appropriately based on our insights about optimal operation of crowd systems. 

We propose a statistics-agnostic scheme, JLTT-MWTA (Alg. 4) that has two parts: JLTT (join least total task) 
directs arrivals to appropriate pools of agents and MWTA allocates jobs in each pool separately. Letting Q l - s (t) be 
the number of unallocated (j, s)-tasks in /th pool just after epoch t — 1, JLTT uses these quantities to direct jobs 
to appropriate pools whereas MWTA uses them to allocate tasks within each pool. 

The JLTT part is computationally light. The central controller only needs to know Q(f) and has to pick the 
minimally loaded (min; Q l - s ) pools of agents to direct jobs (type j). To perform MWTA in each pool, a PTAS, 
LPTAS, or LP relaxation scheme can be used. 

Unlike JSQ, where service discipline in each server is fixed and the goal is to place the jobs optimally, we have 
jobs with multi-dimensional service requirements from time-varying stochastic servers (agent-availability) and have 
to place jobs as well as discipline the service in each random and time-varying virtual pool. Thus optimality of 
JSQ cannot be claimed in our case. But as stated below, JLTT division followed by MWTA allocation is indeed 
optimal. 

Theorem 15. JLTT-MWTA stabilizes any A G Cf. 

Proof: See Appendix nffl ■ 
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Algorithm 4 JLTT-MWTA: Divide and Allocate 
Input: A(i), U (t), Q (t) 

Output: Job division and allocation 

Create pool l with category l agents (VZ E [L]) 

JLTT: Join Least Total Task 
for each (j, s ) do 

Count number of unallocated (j, s)-tasks in pool l: Q l - s (t) 

Divide Aj(t)l(rj jS > 0) tasks equally among pools argmin/.^-q^ Y^ s Q l j s (t) 

end for 

In each pool l run MWTA for single-category system 


An important aspect of JLTT-MWTA is that job allocations within each pool can happen independently of each 
other. The central controller only has to make a decision on how to split the jobs based on the current system 
state information. This allows a more distributed allocation along the lines of following hierarchical organizational 
structure [38]. First, the central controller divides jobs for different agent-pools based on {Q l j s }. Then in each agent 
pool, allocations are according to GreedyJob allocation, which works for both decomposable and non-decomposable 
single-category systems. The distributed scheme that we propose here is an improvisation of the above JLTT scheme 
followed by GreedyJob allocation in each pool. We call it Improvised JLTT and GreedyJob Allocation (Alg. 5). 


Algorithm 5 Improvised JLTT and GreedyJob Allocation 
Input: A(t), U(f), Q(t) 

Output: Job division and allocation 

Improvised JSQ for each job-type j and each skill s: 

n = 0 

N ) = Qtt) 

While n < Aj{t ) 

Send 1 task to the category l* with lowest 
index among arg min;.^^ eB A r j 
Increase iVj and n each by 1 

End While 

Allocations within each pool l: 

Run GreedyJob allocation 


First note that unlike JLTT-MWTA, here we only maintain number of unallocated jobs and do not maintain 
number of unallocated tasks for each skill s. This is because as GreedyJob allocation is used as allocation scheme 
in each pool, Q l - s = Q l - s , for all s, s'. 

This algorithm is also simple to implement. The central controller only needs to track the number of unallocated 
jobs ( Q l At )) from the previous epoch and set JVj = Q 1 -. For any arriving job of type j, the central controller sends 
the job to the pool with minimum N- and updates N-. This continues until the next epoch, when the N- are reset 
to new Q l j values. 

Recall that Sec. |T] gave a simple example of a fully distributed scheme where jobs pick agents greedily (from 
the set of feasible agents as per G) and showed it was not a good scheme. Improvisation of JLTT is proposed for 
a better performance guarantee, while GreedyJob in each pool is proposed for implementability and freedom of 
selection for customers. It is not hard to prove Improvised JLTT followed by MWTA is optimal for any arrival and 
availability process satisfying the assumptions of Sec. [TT] Below we present performance guarantee for Improvised 
JLTT and GreedyJob allocation. 

To present performance guarantees of the distributed scheme we give an outer region C° for the system, along 
the lines of the alternative characterization C 1 of the capacity region Cr for inflexible systems. 


Theorem 16. Inflexible crowd systems cannot be stabilized for A f C°, where C° = j A : A = Yhe[L] ^ where X (r> E C" 1 ,' 1 
and C'p 1 is the outer region for the single category system comprising the Ith category (pool) of agents with 
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n l =E [V 1 ]. 

Proof: See Appendix IlJj] ■ 

For job allocation in server farms, extant performance guarantees are mostly for symmetric load, i.e., symmetric 
(almost) service and job arrival rates and regular graphs, cf. 1301, (3TJ. Unlike server farms, symmetric load (in 
terms of skill-hours) is not guaranteed in crowd systems by symmetric arrival rates and graphs. This is because 
different types of jobs have different skill and hour requirements. The following guarantee for crowd systems is 
for bounded asymmetry (sub-polynomial variation) in agent availability, complete graph, asymmetric job arrival 
rates, and asymmetric job requirements (extendable to regular graphs with additional assumptions on symmetry of 
job arrival rates and requirements). Note that because of the inflexibility constraint, a multi-category system with 
a complete graph is not equivalent to a single-category system. 

Theorem 17. Without loss of generality assume the same ordering of agent types in each category, i.e., M l = 
M l = M/L and h\ = h\ for all l, l', i. If the arrival processes {Aj(t)} and the agent availability processes {U^t)} 
are i.i.d. across time and independent across types (jobs and agents), all these processes are Gaussian-dominated 
(and/or Poisson-dominated), X^ max U' 1/4 ~ di\ anc ^ max fs,s' I X4/4(^4,s — h\ s ,)\ are O (subpoly(N)) and G is 
complete bipartite, then for any given a E (0,1], 3N a such that Improvised JLTT and Greedy-job stabilizes any 
A G (1 — a)C° := | A : pj^A G C° j and the maximum number of unallocated jobs (across all types) is 0(logN) 
with probability 1 — o (-^ 2 ). 

Proof: See Appendix Il{K| ■ 

The proofs of Thm. |TT] [13} and 17 are all based on constructing queue-processes (different for the algorithms) 
that stochastically dominate the number of unallocated jobs, and bounding the steady state distributions of these 
processes using Loynes’ construction and moment generating function techniques. 


B. Implementation of Decentralized Schemes on Crowd Platforms 

We have described the allocation schemes at the level of system abstraction and discussed their performance. 
These schemes can be easily implemented on crowd platforms as well. 

Greedy Agent allocation is completely decentralized, only requiring agents to abide by a norm for picking partial 
tasks, which can be enforced by randomized vigilance and penalizing norm violators in reputation. If the payments 
are the same, as it generally is in single-category systems where all jobs require same quality, there is no incentive 
for agents to deviate from the norm. 

Any arbitrary contention method among agents will work for the algorithm, and hence the crowdsourcing platform 
only needs to ensure that no two allocations are done simultaneously (as practiced in airlines booking). Multiple 
allocations can also be allowed by the platform if they do not conflict. Here the platform has to ensure that an 
agent can place requests only for an amount of tasks it can actually perform, given the constraints on available 
hours. Also, only one agent can request for a task or a certain part of it. Once the agent has been declined, it can 
place request(s) for task(s) of the same or lesser hours. This can either be enforced by appropriate modification 
of the portals by keeping tracks of total hours of requests placed or by vigilance. GreedyJob can also be easily 
implemented on a crowd platform. The platform has to ensure that jobs request agents and not the other way 
around. One way to implement this is to allow jobs to place requests for agents while ensuring they do not request 
more than the required service. Also, the platform has to ensure that skill-hour requests of no two jobs collide. 
This again can be ensured by serializing the requests as above. Agents are expected to accept a requested task, as 
there is no difference between tasks involving same skill since payments are the same. This can also be ensured 
by linking agent rating to rate of task-request acceptance. 

It is apparent that GreedyJob offers choice to customers and Greedy Agent offers choice to agents. By allowing 
a customer (or an agent) to decline an approaching agent (or a customer) request and to explore more options, 
only one option at a time, the platform can provide freedom of choice to agents and customers under both schemes 
while operating at capacity. 

In case of multi-category systems, the platform only needs to direct arriving jobs to the appropriate pool of 
agents, based on current backlog; the rest of the allocation happens as per GreedyJob. Directing a job to a category 
of agents can be implemented in a crowd platform by making the job visible only to freelancers of that category and 
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vice versa (similar to filtering done by search engines and online social networks) or through explicit hierarchical 
organization into pools [38]. 


VI. Beyond the Capacity Region 

We have now characterized the capacity (and outer) regions of different classes of crowd systems, shown the 
existence of computationally feasible centralized schemes that achieve these regions, and presented simple distributed 
schemes with minimal centralized intervention and good performance guarantees for any arrival rate within the 
capacity region. In crowd systems, however, arrival rates may not be within the capacity region, since the platform 
may have little or no control on resource (freelancer) planning, unlike traditional communication networks or cloud 
computing systems. Hence, an important aspect of crowd systems is to turn down job requests. Indeed, deciding to 
decline a job must be done as soon as the job arrives because dropping a job after first being accepted adversely 
affects the reputation of the crowd platform. 

Here, we propose a centralized scheme for a crowd system to decline jobs on arrival in a way that is fair 
across all job types. Our scheme is statistics-agnostic and works even for independent but non-stationary arrival 
and availability processes. 

We solve the following problem: given an arrival rate A, design a statistics-agnostic policy to accept (1 — (3 )A 
jobs on average and allocate them appropriately such that j3 is the minimum for which the crowd system is stable. 
Note that if A £ Cr, the minimum (3 is 0, else, it is strictly positive. We want to design a statistics-agnostic policy 
without the knowledge of A and Cr- As a benchmark, we consider the following problem for e > 0, when A and 
Cr are known. 


min [3 £ [0,1] s.t. (1 — (3 )A + el £ Cr- 


( 6 ) 


Given (3*, optimum of ([6]), (1-/3* )A is within e of the optimal rate of accepted jobs for which the system is 
stabilizable. 

As we want a scheme that is agnostic of A and Cr, we propose the following simple scheme, for v > 0 and 
Q l j s (t) is the number of unallocated accepted (j, s)-tasks ((j, .s)-tasks directed to category l) in the system. 


/3(f) = arg min 
/3e [o,i] 


j P A j(t) - "P £j )S :r,,.>0 QjAWjit) W 

{P A i(t) - V P £j )S :r,,.>0 min Z QjAWjQ) (H) 

Job is accepted w.p. 1 — /3(f), accepted jobs A(f) (I & II) 

f MWTA (I) 

For accepted lobs run < 

I JLtT-MWTA (II) 


(7) 


Steps marked by I (II) are applicable for FD, FND and IND (ID and IND) systems. 


Theorem 18. Crowd system with jobs accepted and allocated according to Q is stable and JV Aj(l — 13* ) — 
ji ^ t=1 E[JA A j( f)] can made arbitrarily small for an appropriately chosen u, for all sufficiently large T. 


Proof: See Appendix 110 ■ 

This theorem demonstrates that by following the job acceptance and allocation method |7]), the crowd system 
can be stabilized while ensuring the average number of accepted jobs per allocation epoch is arbitrarily close to 
the optimal number of accepted jobs per allocation period. Note that as all jobs (across all types) are accepted with 
the same probability, the above result also implies that (1 — /3*)A j — p YPt=i E[Aj(f)] is small. It can be shown 
that the above scheme works for time-varying systems (E[A(f)] = A(f) and U ~ r*_Q) as well guaranteeing small 
limr^oo p YPt=i Ylj (Aj(f)(l — /3*(f)) — E[Aj(f)]V where /3*(f) is the solution of (6) for A = A(f) and Cr = Cr*. 


sectionConclusion 

Human information processing, structured through freelance markets, is an emerging structure for performing 
informational work by capturing the cognitive energy of the crowd. It is important to understand the fundamental 
limits and optimal designs for such systems. 
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In this work we provide a characterization of the work capacity of crowd systems and present two statistic-agnostic 
job allocation schemes MWTA (flexible jobs) and JLTT-MWTA (inflexible jobs) to achieve limits. To ensure low 
computational load on the crowd platform provider and freedom of choice for job requesters, we present simple 
decentralized schemes, Greedy Agent, GreedyJob, and Improvised JLTT-GreedyJob that (almost) achieve capacity 
with certain performance guarantees. These decentralized schemes are easy to implement on crowd platforms, 
require minimal centralized control, and offer freedom of self-selection to customers: all desirable qualities for 
any crowd platform. Due to quick evolution and unpredictability of freelancer resources, crowd systems may often 
operate outside capacity, which inevitably results in huge backlogs. Backlogs hurt the reputation of the platform, 
and so we also propose a scheme that judiciously accepts or rejects jobs based on the system load. This scheme 
is fair in accepting jobs across all types and accepts the maximum number of jobs under which the system can be 
stable. 
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Appendix I 

A. Computation for Centralized Allocation 

For a single category system (L = 1), note that zj s = a : j yS E Z + and hence the feasibility condition (|T[) becomes 

- X] Uihi^ s for all s E [S'], a.j, s E Z H 




with condition ([2]) additionally requiring a JyS = aj >s > for all j,s,s'. Thus, C'(u) is the set of {aj, s } satisfying the 
above conditions for respective classes of jobs (as well as systems) and C is the weighted (by T(u)) sum of convex 
hulls of C(u)s. 

C.'™ 1 has a simple characterization as well. As for any j E [ N ], (j, 1) E E, and A f(J) = 1 for all J C [ N ], 
VeAf(J) Ylie[M l ] I jI Jg = YlieiM] Thus it is sufficient to satisfy the inequality for J = [N], and hence, 


C out fL = 


x ■ J2 Vi < » ihi 

is [TV] ie[M] 

The Max Weight computation in MWTA for single-category decomposable systems turns out to be the following 
integer linear program (ILP), which is related to knapsack problems. 

arg max V Q, ,A, 5 


s.t. J2' 


3,s 

lj,sTj,s — ^ ] Ujllj y fJS E ['S'], 


(8) 


This problem is an integer program, hence it is not clear whether this problem can be solved efficiently at all 
instants. In fact, it is a so-called unbounded knapsack problem for a given u and Q. This problem is known to 


be NP-hard [34|. There is a pseudo-polynomial algorithm based on dynamic programming which solves it exactly, 
but the runtime may depend on Qj, s . This dynamic programming-based algorithm can be converted into a fully 
polynomial time approximation schemes (FPTAS) which achieves any (1 — e) approximation of the problem in 
poly(7) computations. Moreover, there exists faster greedy algorithms that achieve \ approximation and can be 
converted into a polynomial time approximation scheme (PTAS) that achieves (1 — e) approximation in poly (n 7) 
computations. Thus we can conclude that though the MWTA algorithm is computationally hard for single-category 
decomposable system, there exist efficient approximation schemes. It is not hard to show (Prop. [T9] below) that an 
algorithm that gives (1 — e) approximation of the optimization problem in MWTA can stabilize any A for which 

(1—e) • 
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For single-category non-decomposable systems, feasibility condition ([2]) of an allocation of a,j jobs of type j is 
given by, 

^ ^ ^ Ujhj.s^s G [5*] , ctj G Z_j_. 

j i 

Hence the stabilizable region changes accordingly to 


C r = |^r(u)A(u) : A(u) gC^u) j 


C JVD (u) = conv < a JjS G Z + : o-jfj,s < and dj )S = dj\/s > . 


where conv{-} is the convex hull. 

Hence, in this case, the MWTA allocation needs to solve 


argmax^ ( J] Q i;S A j>s 


s.t. 


Ea 


A ^ ^ Ujhj iS , A 


, 3iS = A j tS /, for all s,s' G [5], 


(9) 


and then divide the allocated jobs arbitrarily among agents while meeting their per skill time-availability, as there 
is only one category of agents. 

This problem is also a knapsack-like integer program, with an additional constraint that the number of (j, s)-items 
has to be the same as the number of (j, s'j-items for all j, s and s'. Such a problem is called a multi-dimensional 
knapsack problem. This problem is also NP-hard. Moreover, provably there cannot exist a fully polynomial time 


approximation scheme for this problem [341. 


For a multi-dimensional knapsack problem, there exists an approximation scheme that achieves an approximation 


factor equal to the dimension d [341. This approximation scheme can be converted into a PTAS with complexity 
O (N K ) and an approximation factor of 1 — e, where k is strictly increasing with dimension and In the case of 
our setting, the number of skills S (dimension) is large and may scale with N, hence this scheme is not suitable. 

Though the total number of skills S can scale with N, in most cases the number of skill-parts that a type j 
job has is a constant, i.e., r ; has most coordinates as 0. Thus it is apparent from the objective function that the 
optimal choice of A jyS is 0 for the corresponding coordinates s. This allows us to rewrite the optimization problem 
as another multi-dimensional knapsack problem with constant dimensions given by rnaxj |{s : r :j , s > 0}|. 

For this problem we can use the PTAS to obtain arbitrarily close approximation and hence can stabilize a 
rate-region arbitrarily close to C ND . But the complexity of this algorithm is very high as complexity scales 
super-exponentially (N k ) with the approximation factor (unlike decomposable systems where we have an FPTAS, 
polynomial in -). 

Note that our goal is not to solve Q optimally, but to have a fast allocation scheme that can stabilize a large 
fraction of C ND . In this regard we can take a different approach that exploits basic characteristics of a crowd 
system. Since N and A j(N) for j G [N] are large in most crowd systems, if an allocation scheme stabilizes any 
rate A for A + cl G C, then it stabilizes ^1 — max* x-(N) ) N° te that as A i(N) scales with N, this implies that 
such a scheme would stabilize almost all of C. Motivated by this, we propose the following allocation scheme, 
which is a modification of ([9]). 


{**’•’} = (£%*J 

s.t. Xjfjg < Ujh ly s- for all 


( 10 ) 


and allocate A j = [£j J jobs of type j to the agents, splitting arbitrarily while meeting time-availability constraints. 
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Note that since in ©, the variables are relaxed to R from Z, Yhj (YlsQjA &j — Ylj (A2 S QjA z r Again, 
A' = LA'J — % — 1’ hence JA Qj,s) A-,- > Qj,s) (% — !)• 

The following proposition guarantees that a proposed LP-relaxation scheme stabilizes any A with A + 1 € C ND . 

Proposition 19. Let V be an allocation scheme that at epoch t does an allocation {A(t)} instead of {A(f)} of 
the MWTA allocation scheme, which satisfies 

— y ^ Qj,s(t)^j,s(t) + y ] Qj, s (t)s, 

j,S 3,8 3,8 

or, 


(i e ) ^ ^ Qj,s(t)Aj,s(t) a y ' Qj,s{t)Aj tS (t), 

3,8 3,8 

stabilizes any rate A £ C if X + 61 € C or y- 1 -^ A e C respectively. 

Proof: This proof follows the same steps as the proof of Thm. [8] We first prove the result for an allocation 

with 

y < y ^ Qj jS (t)Aj !S (t) + y ' Qj s (t)5. 

3,8 3,8 3,8 

as follows: 


E[L(Q(f + l))-L(Q(t))|Q(t)] 


< E 


3,8 


-E 


J2Qbs(t) (A i;S (t) - a) |Q(t) 

3,8 


+ E 


Y {tfA) + (A j,. 

3,8 


6) 2 ) |Q(t) 


We can bound the last term as above, because (A j tS — 6) 2 < A 2 s + 6 2 . Thus, 


E [L (Q (t + 1 ))-L ( Q{t )) |Q(t)] < B + E 

YQiAWMV + Am) 

-E 

'y2Q j! s(t)A j , s (t)\Q(t) 


. i' s 




Now note that if A is such that A + 61 E C, then we can write it in terms of convex combinations of d € C(u) 
and follow the same steps as in the proof of Thm. [8] 

Similarly for the other case of constant factor approximation we have: 


E [L (Q(f + 1 ))-L (Q(t)) |Q(t)] < B + E 

J2QjAt)Aj(t)\Q(t) 

-E 

Y Q 3 A*) A A*) i 1 - e )IQ (*) 


. i' s 


. i' s 


as (1 - e) 2 A 2 s < A\ s . 

If A e (1 — e)C, then by definition of C and (1 — e)C, there exist z/(u) E C(u) and {dfc(u) E C(u)} such that, A < 
(1 - e)i/(u)T(tt) and Jf k y k d k = u(u) for y k > 0, Jf k y k = 1- As for any d fc for a given u, ffj. s Qj,s(t)^j,s(t) < 

the result 


Ej, s Q3At) d j, S ’ Ej, sQiAt)Xj < e J2j, s Q3At)A jtS {t)\Q(t) 


J],S ~63,8 

follows. 


. Then following the proof of Thm. 


Appendix II 


In this appendix, we present proofs of the main results in Secs. Ill VJ As mentioned earlier, most of these results 
extend to systems with stationary and ergodic arrival and availability processes, but here we only present results 
for i.i.d. processes. 
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A. Proof of Theorem [5] 

Here we only prove the converse part, i.e., A f C cannot be stabilized by any policy. For the direct part, it is 
sufficient to prove there exists a scheme that stabilizes any A £ C, and so the proof of Thm. [ 8 ] below is sufficient. 

First we prove that C is a convex subset of K+. If A, A' G C, then there exist (A(u) G C(u) : u G and 

(A'(u) 6 C( u) : u G such that 

Y r ( u ) A ( u ) = A ’ Y r ( u ) A '( u ) = A/ - 

u u 

Thus for any 7 G [0,1], 

7 A + (1 - 7 )A' = Y r ( u )(7 A ( u ) + (1 - 7) A '( U )- 

U 

Note that C( u) is convex since it is the convex hull of C(u); hence 7 A(u) + (1 — 7 )A'(u) G C( u), which in turn 
implies 7 A + (1 — 7 )A' G C. This proves convexity of C. 

Thus C is a closed convex set. Hence for any A° f C, there exists a hyperplane h T x = c that separates C and 

A°, i.e., there exists an e > 0 for any AgC such that h T A° > h 1 A + e. 

Hence under any policy: 

E [h T Q(t + 1)] = E [h T (Q(f) + A (t) - D(f))] 

= E [h T |Q(f) + A(f) — A(f)| + ] 

where A (t) is the number of possible departure under the scheme if there were infinite number of jobs of each 
type, and | • | + is shorthand for max(-, 0). As |x| + is a convex function of x, h T |Q(£) + A (t) — A(f)| + is a convex 
function of Q(t), A(t), and A (t). Thus by Jensen’s inequality: 

E [h T |Q(t) + A {t) - A(f)| + ] > h T |E [Q (£)] + E [A (£)] - E [A(f)]| + 

Note that any A is a r(u)-combination of some {A(u) G C(u)} and any A(u) is some convex combination of 
elements of C'(u). Also, from the allocation constraints it is apparent that if a G C'(u) then also a' G C'(u) if 
a' < a. These two imply that for any A G C, if there exists a A' < A (component-wise) and A' > 0, then A' G C. 
That is C is coordinate convex. This in turn implies that for any \° f C there exists an h f 0 G such that 
hyperplane separation holds for this h. Thus for h > 0: 

E [h T Q(t + 1)] > h T |E [Q (£)] + E [A (£)] - E [A(f)]| + 

= Y \ h J E [QiW] + hj'E [.Aj(t )] - hjE 
3 

> Y (^i E IQ3W + h 3 E [MW ~ h J E [Aj(f)]) 

3 

> E [fi r Q(£)] + h T E [A°] - sup h T A 

A ec 

> E [fi r Q(f)] + e 

Thus we have E [/i 7 Q (t + 1)] —> 00 . As h > 0, this implies there exists j such that E [Qj(t)\ —> 00 as t —> 00 . 
Hence, the system is not stable. 

B. Proof of Theorem [ 6 ] 

Consider the dynamics of Qj^ s (t), the unallocated (j,s) tasks at the end of epoch t. 

Qj,s(t + 1) = \Qj,s(t) + Aj(t) — Dj jS ( f)| + 

7 Qj,s(t ) + Aj(t) — Dj )S (t) 

t 

pYiMV-DUt)). 

k =0 


19 


As rj jS > 0, 


E [ r j,sQjA t + 1 )1 ^ E E [Aj(t)r jtS - r jjS D jjS (t )] 

k =0 
t 

= y~i Aj r j,s — r j,s e [-Dj )S (t)]) 


/c=0 

Consider any set J C [iV], then at any epoch t, to schedule a certain number of tasks of type (j. s), the system 
needs that much available usable skill-hours. This follows from conditions <|T]) and ([2]) and can be written as: 

e r j,sDj,s(t ) < e e ee* - 

j&J leN(J)ie[M l ] 


This in turn implies 


Hence, 


y~] r j,gE [Dj, s (t)] < e y^ h i is p is . 

j&J leN(J) ie[M l ] 


E 


E r 3isQi,s{t + 1) 
jeJ 


— EI EVj,- E 

fc=o \jeJ jeJ 

^e(ev.>- E E EE 

fc=o \jeJ leAf(J) ie[M l ] 


( 11 ) 


For any A <f C , by definition there exists a J C [N] such that J2jeJ^j r j,s ~ J2ieN(J) J2ie[M l ] K, 8 Pi, 3 > °- 
Thus in that case, limsup^^ E Yhj^j r j,sQj,s(t + 1) = oo. Note that since J is finite and so is nraxj r hS , there 
exists a j F J such that limsup t ^. 00 E [Q JyS (t + 1)] = oo. This shows the system is not stable for A f. C out and 
proves Cr C C out . 


C. Proof of Proposition [7] 

We consider FD, FND, and IND cases separately. 

FD: The Max Weight part chooses z l - s to be integral which implies that integral number of tasks can be allocated 
if done appropriately. As hours are allocated from tasks in order, a task later in the order only gets allocated 
(partially or fully) after the tasks before it are fully allocated. This leads to no partially-allocated tasks. 

FND: Same ordering is used for all (j, s ) tasks and Max Weight chooses z l - s such that a hS = a J:S > (as it satisfies 
FND) and hence if an s-task of a job is chosen then also s' is chosen for r ]yS . rj yS i > 0. 

IND: Same ordering is used for all ( j , s) tasks, allocations to different categories arc in same order (1=1 to L ) 
and Max Weight chooses z l - s such that z l - s = z'- s , for all l, V (as it satisfies IND), hence if a task of a job is 
allocated to category l then so are the other tasks. 


D. Proof of Theorem [S] 

Note that Max Weight chooses an allocation {A JiS (f)}. But, the maximum number of (j, s)-tasks that can be served 
is Qj jS (t) + Aj(t). By Prop.[7j Task Allocation does a feasible allocation for FD, FND, and IND systems. Also, note 
that in the Task Allocation algorithm, the number of allocated (j, s)-tasks is Dj >s = min ( /\ JS (f), Q ]yS (t) + Aj(t)). 
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Consider the usual Lyapunov function L (Q) = V ^ g Q? g . 
E[L(Q(i + l))-L(Q(t))|Q(t)] 

= E Y,(QUt + i)-QlM\Q(t) 

. i’ s 

= E ]T ((QiAQ + MQ-D^y-QZ^t)) |Q(f) 

_ j,s ^ ' 

— E E] (^(Qj,s{i) + Aj(t) — Aj's') — Qj s (t)^j |Q(f) 


E ^Q j;S (f)^(f)|Q(f) -E J^Q jtS (t)A jiS (t)\Q(t) +E £(^(t) + A 2 


We bound the last term first. 




J]E[A 2 ]+E £M„ 


< E e A 2 i + min( 1 , 0) E £,,A, 

Ac \ J 5 / \ n a 


i + mm(rj 1 , >0) E eie^-W 


j, s v j>o y l V i ’ 1 v s 

£E E ^]+ V mi „(r,„>0) "if 4^ 


z \2l M(M + 1) 
V 2 


( 12 ) 

(13) 


This is a constant B < oo independent of Q, as E[A|] and E (C //) 2 are finite for all j, Z, i. 

To bound the first term, note that if A Tel (E (J, then there exist (E such that A j ^ E u r(u) z/(u) — e 

for all j G [J], Again note that as C(u) is the convex hull of C(u), z/(u) = Yhk 7 fcd fc (u) for some {d fc (u) G C(u)} 
and 7 ^ > 0 with )T fc 7fc = 1 . So, 

E iqw = 

_ j,S J j,s 

— 'y y QjjsityVj — e ^ ^ 
j,s j,s 

< ^2QjA t )^2 r ( u )^2 r ykdj(n) ~eY^Qj,s 

j,s u /c j,s 

= E r ( u )E^>wE^( u ) ~ e E^- s 

u j t s /c j»,s 

< E r ( u ) ma A , E Qr,Ai)^,. s (u) - e JE <2j,« 

d U Gc U 

u J,s 3,s 

= E ^2Q jjS (t)A jjS (t)\Q(t) -eY^Q j)S - 
j,s i,s 
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Thus, we have a bound on the Lyapunov drift, 

E [L (Q(t + 1 ))-L (Q (t)) |Q(t)] <B-e^ Qj,*- 


Hence, 

T—l 

E[L(Q(T))-L(Q(0))]<£?T-e EE ti^E^t)]. 

t=0 j,s 

As L(Q(0)) < oo and L(Q) > 0 for all Q, we have that for all T, 

^EE 6 [Q,,,(()]< | + M < 

t= 0 j,s 

This in turn implies limsup^^ s E[Qj jS (f)] < oo, otherwise the time-average cannot be finite. This implies 
that for all (j,s) with rj jS > 0, limsup^^ E[Qj )S (i)] < oo. 

Again note that Qj(t) < Qj, s , as there can be unallocated jobs with more than one part unallocated. Hence, 

limsup t _ > . 0O E[Q. / (i)] < oo for all j G [N]. 


E. Proof of Theorem 11 


In the Greedy Agent algorithm, as each agent with available skill-hours greedily chooses to serve a task, no (j. s ) 
task of size r can remain unallocated if there is an agent (or agents) with s skill-hour (total) of at least r. Since at 
each allocation epoch a task should either be allocated totally or not at all (i.e., x < r hours cannot be allocated), 
it may happen that some agent hours are wasted, as that does not meet the task allocation requirement. 

Note that since any job requirement is less than r = max J)S r :hS , at most r agent-skill-hours can be wasted. 

Let H s (t) be the process of unallocated job-hours for skill s after the allocation at epoch t. Then for all t, 

+ 


H s (t + 1) < 


H a (t) + E - E Ui(t)hi,s + - 


This implies that process G s (t) given by G s (t + 1) = G s (t) + ff j Ui(t)hi jS + r bounds H s (t ). 

G s (t) has dynamics of a queue with arrival process X ft) = JE Aj {t)r^ s + r and departure process Yft) = 
Ui(t)hi tS . Let Aj(9) = E [e^W] and UfO) = E [e ec/i ^] for j G [N] and i G [M], 

For 9 G M, then, 


E 




= E 


jx 3 (t) 


E 




= E \e 6 ^o A f t ) r j'*+ r E |" e - 0 E iUi(t)hi, : 

[ e * A ' (t)r '-] n E[Ui(t)hi ia ] 


= e 


(14) 


= e 


dr 


]~J Aj{9r jtS ) WUi(-9hi tS ) 


exp [or + ^2 log Aj(6r jtS ) + ^ log Uf-Qh^g 


First consider the Gaussian-dominated case. Since the process variance is no more than the mean and the moment 
generating function of the variance is upper-bounded by that of a zero-mean Gaussian: 

\2 


log Aj(0rj )S ) < Xjdr jtS + \j 


( 0r j,s 


log If ( 91 11 s ) A f i j 9h j s -)- fij 


(oh itS y 


2 
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Note that for any two functions k\x 2 and k- 2 ,x, kxx/k\x 2 = oo, and hence for any e £ (0,1) there exists 

x* > 0 such that for all x < x*, k\x 2 /k^x < e. Hence for any e £ (0,1), there exist 9* s ,9* s > 0, for all i,j,s 
such that for all 9 < 9* = min i,j, s (9* s , 9* s , 

log Aj(9r jtS ) < Xj9*r j:S (l + e) (15) 

log Ui(-9h itS ) < -m6hi iS {l - e) (16) 

Note that since N, S, and M are finite and 9* s ,9* s > 0, for all i,j,s,9* > 0. Moreover, note that 9* does 
not depend on A,/i since the ratio of the linear and quadratic terms in the log moment generating functions are 
independent of A and fi. 

As e e — 1 = YlkLi f° r the Poisson-dominated case we have 


log Aj(0r jtS ) < Xj Y ''''I,* 

k 

( _ nu . 

log Ui{ -9hi }S ) < gj Y -^ 


Again, by the same argument, we can have a 
have: 


for which (261 and (27) are satisfied. Thus, for all 9 < 9* we 


E 


J(X s (t)-Y s (t)) 


< exp | 9r + Y, 1 + e ) - Y 1 “ e ) 

\ j i 

< exp [9 f - Y ^ihi,sA - e) • 


(17) 


Note ([28]) follows from the fact A £ (1 — a)C out . As e > 0 can be chosen arbitrarily' small, we can have a — e > 0. 
Since JA> Jl j ( 1 _ a )A r j,s and Yjj A r j,s sca l es with X(N), for sufficiently large A a with Xj > X a for 
all j, we have r — Afi^i,s( a — e) < —7 JA HiKA a ~ e )> f° r some 7 > 0. Thus, we have for some 9 > 0, 


E 


e e(x s (t)-Y s (t)) 


< exp ( —9K ), 


(18) 


where K scales with A. 

Now by Loynes’ construction: 

t 

G s (t) = ma xY( x s(t) - Y s (t)). 

k=r 


Hence, 


Pr (G s >g)< 


E [e eG °] 


o6g 


< e 


- d9 E [f 


< e~ e9 Y ( E 


9 max, E,A 0 P£fi)-Afi)) 


0 e(Xs(t)-Y s (t )) 


T— 1 
OO 


< e ° 9 Y^ ex P (— 9tK ) 


T= 1 


< e 


-9g _ 


(19) 


1 — exp ( —9K)' 
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This in turn implies 


Pr f max G s > c log < SPi{G s > clogJV) 

gg—Oc\og N 

~ 1 — exp {—OK) 

_S _ 1 

— N dc 1 — exp {—OK) 

Note that the total number of unallocated jobs in the system is upper-bounded by (minj iS:r . o >o Yhs^s, 

as any unallocated job has at least rnirij., s:r s>0 rj iS skill-hours unallocated. Hence the total number of unallocated 
tasks in the system, Q, satisfies: 


Pr {Q > cS log N) < Pr ( G s > c min ry s S\ogN ] 

' ^ * ji S '- r j,s> o ’ ' 




< Pr [ max G s > c min r ? - s log N ) 
V S j,S-Tj,s> o ) 


S 


N 1 


0c min 7 


«>° r J> 1 — exp {—OK) 

Note that 0* > 0 does not depend on A, so 1 _ cxp 1 (_g^-) is 0(1)- As S = 0{N), we can choose a c > 0 such 
that 0*cminj jS:r . s >o ry,. s > 3 and hence, Pr {Q > cS log N) < o{N~ 2 ). 


F. Proof of Proposition [72] 

We prove this proposition by constructing a simple (but general) system and show that the system is not stable 
via a domination argument with a Markov chain that is not positive recurrent. 

Consider N = 1, M = 2, and S > 1 being even. Let r = 1, hi = terms, 0,0, •••) and h 2 = 

(0,0, • • • | terms, 1,1, ■ ■ ■). Let the arrival to the system be i.i.d. with 

(1 — a — <5)A, w.p. (1 — e) 

2A, w.p. e, 

such that (l + a)e —5(1 —e) = 0, resulting in mean arrival rate (1 —A) and variance < A. One possible construction 
is to take e = yy and 5 accordingly. The agent-availability process is considered to be U\ = U 2 = A. Note that 
both arrival and agent availability processes arc Gaussian-dominated as well as Poisson-dominated. 

First, consider the case where greedy picking of tasks by the agents may be adversarial. Each agent picks all the 
tasks that it can take from the job and so agents of different types pick from a different half of the S skill-parts. 
Thus if there are > 2A jobs, in worst case (where adversary gives the job to the agent) agents of type 1 may pick 
^ job-parts of A jobs, while agents of type 2 pick parts of other A jobs. Hence no job is actually allocated at that 
allocation epoch. By the next epoch at least (1 — a) A jobs have come and if the agents pick jobs in an adversarial 
manner, again no job is actually allocated. Thus the number of unallocated jobs keep growing after it hits 2A once. 
Note that since there is a positive probability of > 2A arrivals, with strictly positive probability, the number of 
unallocated jobs grows without bound. 

Next, we prove the case where greedy picking of the tasks by the agents is random, i.e., an agent picks all S/2 
parts of a randomly selected job (without replacement). As arrivals and availability are i.i.d., we can describe the 
number of unallocated jobs by a Markov chain Q{t). Note that for Q < A, Pr(Q —> 0) = 1. On the other hand, 
0 < Pr(0 —> Q) < 1 for Q < 2A. 

Consider any Q = n\ for n > 1. Note that Pr(Q —> x) = for x < (n — 1) A as no more than A jobs can be 
scheduled, because there are A agents of each type. Again, 

Pr (Q decreases at least by 1)<1 — (1 — —--- 

V Q + A - aA 

because there are at least (1 — a) A arrivals and each type picks A agents randomly and this is the probability that 
the picked sets have a non-empty intersection. Again, as there are 2A arrivals w.p. > e we have 

Pr (Q increases by at least A) > e (1 — ———- 
V 3 \ Q + 2\ 
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Based on computation of transition probabilities for each transition, it follows that for all k > 1, the probability 
of Q decreasing by at least 1 as well as the probability of Q decreasing by k is decreasing with Q, whereas the 
probability of Q increasing by k increases. 

Hence, we can dominate the above chain by another chain Q on AZ + with transition probabilities 

Pr (An —> A(n + 1)) = e ( 1 --—^ 

V n + 1/ 

Pr (A(n + 1) — > An) = 1 — 

Now the chain Q(t) is a birth-death chain. If it has a finite (summable over states) invariant, then that is unique. 
We first assume that the invariant is tt and then show that it is not summable to prove that it is not positive 
recurrently As this is a birth-death chain the invariant measure must satisfy: 

7r(n + 1) = 7r(n) — 

1 




Since 


e 




—> oo 


as n —> oo for any finite A > 0, this shows that n is not finite. 


G. Proof of Theorem 13 


Consider the different types of unallocated jobs. These are given by {Qj(t) : j G [JV]}. 

Consider the following processes: for each s £ [S], Q s {t) = YT. r >o Qj r j,s which represent the number of 
unserved hours of skills s over all jobs. 

We now construct another process Q s.t. it dominates the process )C S . Q s . So, if we can show upper-bound on 
Q, then the same bound applies for Q s . Hence, in turn we get a bound for { Q-j (f)} (as min{r J)S > 0} = 0(1) 
by the assumption that {rj, s } do not scale with the system size). 

Towards constructing a suitable Q we make the following observation about the dynamics of Q s and {Q-j}- At 
each time t, f Aj\ (t)r hs amount of s skill hour is brought to add to Q s . Also, this queue gets some service 
depending on the available agent hours. 

At time t, ff rn U rn (t)h rn , s s-skill hour of service is brought by the agents. 

For a job to be allocated, all tasks of it must find an allocation. Hence, for a job in type j-job to find an allocation 
it must get r ]yS hours of service from each skill s. Thus at any time t any skill s queue gets a service of at least 


min \ U m h m s - r, 

«e[S] ^ 

L J m 

where r = maxjrj^^}. This is because of the following. For each skill )T) s U rn h rn . s hour is available. Note that 
a job can be allocated if all its tasks find allocations, coverse of which is also true. That is if all tasks of a step 
found allocation then the step can be allocated. As min<, g rgi )T) si U rn h rn , s hours of service is brought by the agents 
for each skill, at least min sg [g] U m h rnyS — r of s-skill hours are served (because a maximum of f can be wasted, 
as no task is of size more than r). 

Also, note that the amount of required service brought to the queue Q s at time t is upper-bounded by 


max ) Aj(t)rj <, 
s&[s)^ 3 ’ 

4 An alternate proof follows from noting that if we take a Lyapunov Q itself, then it has bounded jumps and it is easy to check that after 
certain Q > 0 the drift is strictly positive, and invoke the Foster-Lyapunov (converse) theorem for irreducible chain with bounded absolute 
drift. 
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Consider a process Q s with evolution 


Q s (t + 1) = max(<5 s (t) + max 

— min E U m h 

m,s + r,0). 
sG 


Note that given Q s (to) > Q s (to) at some to, the same holds true for all t > to- This is because for x. a, 6 > 0 and 
x' , a' , b' > 0, with x>x',a>a' and b < b' 

max(x + o — 6,0) > max(a/ + a — b', 0), 

and hence, the monotonicity propagates over time. 

Thus, to bound Q s it is sufficient to bound Q s (t). Note that each of Q s has exactly same evolution, so 
let us consider 

Q := SQ\ 

which bounds Y s Q s . 

From the evolution: 

Q(t + 1) = max(Q(t) + S max Aj(t)r^ s - S min U m h m * + r, 0) 

se[S] z —' 


se[S] • 


we can write the Loynes’ construction for this process which has the same distribution as this process (and for 
simplicity we use the same notation, as we are interested in the distribution). 


Q 1 (0) = max y (S'max y AAt)r.j s — S min y U m h m s + r ), 

r<rn ^' ocrci ^' J v 7 J ’ sefS] z ' ’ 


t<o ' s e[S] ■ 

“ T<t< 0 1 ‘ J 


assuming that the process started at —oo. 

Let us define X s (t) and Y s (t) as follows: X s (t) := JN Aj(t)rj tS and Y s (t) := YmUmhm,s- Then, 


Now, for any 9 > 0 


Now, 


Q 1 (0) = max Y^ S(m&xX s (t) — min Y s (t) + r). 

~r< 0 x. - J c c 


T< 0 ^-' 8 

~ T<t< 0 


Pr (Xl > r ^) - Pr £ > ?) 


< Pr(Qi(0) > q) 

= Pr(0Qi(O) > Oq) 

= Pr(exp(0Qi(O)) > exp {Oq)) 

< E[exp(-0q)]E[exp(#Qi(O))]. 


( 20 ) 


( 21 ) 


E exp(0Qi(O)) = E exp(0S f max Y^ (max X s (t) — mm7 s f + r) I) 

I T< o z — ‘ s s l 

\ ~ r<t< 0 

< y^E[exp(0S y (max2f s (t) — minYiTf) + f))l, 


r<0 


T<t< 0 


( 22 ) 


where the inequality in (221 follows because for any random variables { Z :] }, exp(0Zj) are positive random variables 
and sum of positives are more than their maximum. 

Next, we bound the term within the summation over r < 0 in 


E[exp(#S Y (max2f s (t) — minYs(f) + r))l < TT E[exp(0(max2f s (f) — min Y s (t) + r)))l. (23) 


r<t<0 


T<t< 0 
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Inequality in ( [23] ) follows because X s (t), Y s (t ) are i.i.d. over time. 
Next we bound the term within the product n -<t<o ' n 


E 


3 0S(max s X s (t)— min s Y a (t)-\-r) 


< 


£4 

s,s' 


,0S(X.(t)-Y,,(t)+r) 


Inequality ( |24| ) is due to the same reason as ( |22| ). 

Let Aj(9) = E [e 0j4j ^] and U m {9) = E for j G [ N] and m G [M]. For 0 G M, then, 


E 


J(X a (t)-Y a ,(t)) 


= E 


3 0X.(t) 


E 




= E | ~ e e T, j A i(t) r i,s+ r E [ e -®£ iUi(t)h. 


= e 


dr 


= e 


dr 


\‘ 

JJE |r" ' ’ JJE [e 

3 * 

|^[ Aj(9rj jS ) Y\_Ui(-9hi 


—dUi(t)h ia 


(24) 


(25) 


= exp | 9r + Y log Aj(9r^ s ) + Y log Ui(-9h i>s 

\ 3 i 

Note that as A G aC, by the definition of C°, ^j r j,s < a J2 m Vmhm^ and by assumption j J2 rn l l mhm,,s ~ 
'Yhm l i mhm,s' \ £ subpoly (N) which is used in the following. 

First consider the Gaussian-dominated case. Since the process variance is no more than mean and the moment 
generating function of the variance is upper-bounded by that of a zero-mean Gaussian: 


log Aj(9r jjS ) < A j9r jtS + A j 


( 0r j,iY 

2 

(9hi, s y 


log Ui(—9h itS ) < ~Hj9hi tS + Hj 

Note that for any two functions k\x 2 and k^x, liin :J; ^o k^x/kix 2 = oo, and hence for any e G (0,1) there exists 
x* > 0 such that for all x < x*, k\x 2 /k-)X < e. Hence for any e G (0,1), there exist 9* s ,9* s > 0, for all i,j,s 
such that for all 9 < 9* = min i,j, s (9j s , 9* s , 

log Aj(9r jXs ) < Xj9*r jtS (l + e) (26) 

logUi(-9h ijS ) < ~Hi9hi tS { 1 - e) (27) 

Note that since N, S, and M are finite and 9* s , 9* s > 0, for all i,j,s,9* > 0. Moreover, note that 9* does 
not depend on A,/i since the ratio of the linear and quadratic terms in the log moment generating functions are 
independent of A and fi. 

As e e — 1 = tt> f° r the Poisson-dominated case we have 


log Aj{9r jt s) < A j Y 

k 

log Ui(-9h i)S ) < Hj Y J 


( 9r jtS 


k\ 

( -9hi iS ) h 
k\ 


Again, by the same argument, we can have a 9* for which ([26[» and (27) are satisfied. As I V, luh, V- n,hj ,4 = 
o(N s ), for all 5 > 0, and mh M = n(N c ), c > 0, for all 9 < 9* we have: 


E 




< exp I 9*r + Y ^j0* r j,s0- + e ) - Y - e) + 9*o(m9h iiS ) 

\ 3 * 

< exp 6» f - Y kihiAp- ~ 2e) . 


( 28 ) 
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Note ( |28j ) follows from the fact A € (1 — a)C out . As e > 0 can be chosen arbitrarily small, we can have a — 2e > 0. 
Since n,h hS > V \(1 — a)Xjrj tS and V A jrj t i jS scales with A (N), for sufficiently large X a with A j > X a for 
all j, we have f — A khi, s {& — e) < —Tz2i A Hhi, s (ot — e), for some 7 > 0. Thus, we have for some 6 > 0, 


E 


e es{x s {t)-Y s ,(t)) 


< exp ( —0SK(N )), 


(29) 


where K(N) scales with iV no slower than Yl s -r x >0 = fi(AA c ), c > 0. 

Thus, 


E 


D 0S'(max s X s (t) —min s Y s (i)) 


Hence, from (22 1 , (23), and (24) we have that 


< S 2 exp{-0SK(N)). 


E[exp(gy^Q s (0))] = E[exp(0S'(3 1 (O))] 

S 

= E[exp(0Q(O))] 

< Y 5 ' 2T| exp(-05iT(iV)|r|) 

r<0 

<c', 


because S 2 < exp (6*SK(N)) for all sufficiently large N. 

Note that though we proved E[exp(0Q(i))] < d for t = 0 , this holds for any finite t with exactly the same 
proof. Hence, 

Pr(Q > clogiV) < Pr(Q > clog A) < de~ cd Iog N , 


which gives the result for an appropriate choice of c. 


H. Proof of Theorem 14 


To prove that any A e C L is stabilizable it is sufficient to invoke Thm. [ffi] whose proof is below. To show that 
X° d C 1 is not stabilizable, we take an approach similar to the proof of Thm. [ 5 ] 

Note that the set {A : A = A 1 , A 1 6 Cp} is convex. Also, if A' < A (component-wise) for some A belonging to 

the set, then A' £ C 1 . That is, C 1 is coordinate convex. Hence for any A ^ closure of C L , there exists a hyperplane 
h > 0 that strictly separates it from C L , i.e., for any {A 1 £ C 1 }, for some e > 0 


h T X° > h T Y + e 

1 

Note that at any epoch t, number of j jobs allocated A j(t) = ZN^-(A), where A((f) is the number of j jobs 

allocated to category l agents. Following similar steps as in the proof of Thm. [5j the result follows. 


I. Proof of Theorem 15 


Let Qij iS (t ) be the number of unallocated (j, s) tasks in the /th pool and Aj(t) be the number of arrived jobs 
of type j. Let be the number of jobs that are sent to pool l and Aij = Aj(t). At any pool l Max Weight 

is followed and Dij tS (t) is the number of allocated (J. s) tasks in pool l. 

Consider a Lyapunov function L(Q) = YlijsQljs- Th en: 


E [L (Q(t + 1)) — L (Q(i)) |Q(f)] 



-E 

IQ(*) 

+ E 

Y, |QW 

J,j,s 


f,j,s 
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The last term can be bounded by noting: 


Y \ A h^ + ^h,s) - I Y \ Ai j^ + 

1,3,8 \l,j,s 


Y^ Aj(t) + A,j,s 


3,8 


l,3, s 


< 


Y A 3 (*) + 


3,8 

< B' < oo. 


min(r J)S > 0) 


Y h l u i 


1,1,s 


(31) 


(32) 


Note ( fTi~[ ) follows similarly as ( p~2| ), whereas ( [32] ) follows because the arrival and agent-availability processes have 
bounded second moments. 

To bound the first term: 


E 


Y^A^Aj^) IQ(*) 

1,3,8 


= E 


< E 


Y [Y^At) KiCOlQCO 


A 


Yy A j®Q l J,*® IQ(*) 


1,3,8 

— Y^ (Y] Qi,j,s(t)\ 


(33) 


(34) 


3,8 


where (331 is because of the fact JLTT-MWTA sends all arrivals of type j to the pool l with minimum V.. Qij tS (t). 
On the other hand, 


E 


/* ^ Qi,j,s(t)A-i,j,s{t) |Q(f) 

1,3,8 


E e 




3,8 


(35) 


because at epoch t each pool l runs Max Weight based on only Qi(t) and {Ajj iS : j, s} is independent of 

{A V j(t),Qi'j jS (t) given Q ft). 

For every l, we can compute the difference between the Zth term of ( |34| ) and that of (351, which is similar to the 
first term of ( fT2] ). If A + e E C L , then X 1 + € C l , hence following the same steps as in the proof of Thm. [8j we 

have 


Y Ql,3,s(t) xl j -E 


3,8 


Y^A^YjAt) IQ'(t) 


3,8 


— £ Y. Qi,j,s(t)- 

3,8 


This in turn implies 


E [L (Q(t + 1 ))-L (Q(f)) |Q(i)] QuAQ- 


1,3,8 


Following similar steps as in proof of Thm. [8j we obtain lim sup^^ 22 1 j■ s E [Qij jS (t)] < oo, which in turn im¬ 
plies that for all j, s, l, limsup^^ E [Qij, s {t)] < oo. This proves the theorem since for all j, Qj(t) <22i s Qi,j,s(t)- 


J. Proof of Theorem 16 


It is sufficient to prove that C 1 C C°. Consider any A e C 1 , then by definition of C 1 , A = V, X 1 , where for 
all l, X 1 E C l . By the characterization of the outer region of single category systems C 1 C Cj mt , X 1 E C"" 7 . Thus 
A = 22 1 X 1 for X 1 E C° ut , for all l. Thus by definition of C°, X E C°, which completes the proof. 
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K. Proof of Theorem 1 7 


We find a high-probability bound on the number of unallocated jobs of type j and then bound the maximum 
number of jobs across type. To bound the number of unallocated jobs, we use stochastic domination based on the 
nature of the method of splitting job arrivals across different pools. 

Consider the dynamics of Q*{t) = max; Qi. f t,). Let Ajj (t) be the number of jobs of type j that were directed 
to pool l. In the Improvised JSQ step, jobs are sent one-by-one with minimum backlog and hence, the queue I* 
with Q l - (t) = Q*j(t) gets the minimum number of jobs. Since minimum is less than average, Ajj*(t) < Q n 

the other hand, just before allocation at epoch t + 1, the total number of j-jobs in l* cannot be less than the number 
of jobs in any other l by more than 1. This is because of the Improvised JSQ which allocates jobs one-by-one to 
the lowest backlogged (iVj) queue at that time. Hence, we have 

Q*{t) + + 1 > Qj,i(t) + Ajj(t). 

Given Qij, for GreedyJob the number of jobs that can be allocated (assuming number of queued jobs to be 
infinite) A;j is monotonic in U ; , i.e., if U* > U /Z (component-wise) A; j > A 'lj for all j. This property will be 
useful below. 

Consider the following dynamics Q 3 (t),j £ [N], Arrivals for each j are according to [ Ai j ® ~| + 1 and agent- 
availability is according to U l {t) = min; U\(f). This is a single-category system and allocations in this system are 
according to GreedyJob. As U l (t) < U\(t) for all i,l, the number of allocations (assuming queues to be infinite) 
satisfies A 3 (t) < AThis implies that for each type j Q 3 (t ) dominates Q*(t) as the first queue at any epoch 
has more number of jobs to be allocated and less number of possible allocations (as A 3 is smaller). Thus, it is 
sufficient to bound Q 3 (t). 

For this we proceed along the lines of the proof of Thm. 
agent-availability by U l {t) = min; U\(t). 

Hence, for each skill s, the queue of unallocated skill-hours H s (t ) is the arrival X s (t) = X 1 (t)r J:S and the 
possible amount that can be drained is Y s (t ) = U l (t)hi )S . Hence, 

Q\t + 1) = | Q s (f) + X s (t) - Y s (t ) + f| + . 


13 


replacing arrivals by A 3 (t) = |" 


_ r X (*) ~ 


+ 1 and 


On the other hand, 


We can follow similar steps by noting that for 9 > 0: 

E 

■ e -e^ 

= E 

g min i Ul(t)ht,s 



= E 

g—0min; Ul{t)h itB 



= E 

maxe-^^'^' 

l 



< J]E J e - 0 £.4(t)N. s 


E 




<e 6 ^ 


r +1 > f [jE[e^ 


-Aft) 


Making the assumption on agent arrival rates, 


£4 


- 0 £ 


< e 


log L+o (subpoly (N )) ]g 




(36) 


(37) 


Since max;.;/ — p l -\ = subpoly(A) for all i, A 6 (1 - a)C° implies that f — subpoly(A) £ C out . Thus, we can 

use the same steps as we did for single-category systems. 

For Gaussian-dominated as well as Poisson-dominated cases, following similar steps we can obtain that for a 
9* > 0 (independent of A) and sufficiently large A, 


E 


e 0{X‘-V) 


< e -rMA)+io g L+o(subpoly(AO) 


Note that K(X) increases as Q(min, A;) = Q(N C ) for some c > 0, L = 0(1), hence there exists A sufficiently 
large such that —9*K(X) + logL + O (subpoly(AQ) is strictly negative and E [e e< ~ x ^ <1. The rest follows 
similarly as the proof of Thm. mi 
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L. Proof of Theorem 18 


We first consider FD, FND, and IND systems. 

Let Aj(t) be the number of accepted jobs of type j between starts of epochs t — 1 and t. Let A j jS (t) be the 
number of allocated (j, s ) tasks by the Max Weight part of MWTA before the execution of Task Allocation. Let 
Dj.sit) be the number of allocated (j, s) tasks at allocation epoch t at the end of Task Allocation. Then: 


£ (QUt + i)-QlJt)) = £ ((QjM + MQ-DjAttf-QUt)) 

0 j,s:r jtB > 0 

< £ (iQjAt) + Aj(t) - A jt s(t )) 2 - QjAQ) 

j,s:r^ B > 0 

= E %■(*) (Tw - Aj,.(t)) + E (%(‘)+ 

j,s:r jtB > 0 j,s:r jtB >0 


Expectation of the second summation (conditioned on Q(i)) can be bounded by noting that Aj(t) < Aj(t) and 
the fact that the arrival processes have bounded second moment. The value E [£■ s . r >0 A j s (t) can be bounded 
similarly as in the proof of Thm. [8] Flence, we consider the expectation of the second term to be bounded by B 
independent of Q. 


E 

j,s:r j>B > 0 

< B + E 


< B + E 


£ Qj,a(t) ~ AIQ(t) 

>0 

£ Q j:S (t) ((1 - - A j>8 (t)) \Q(t),A(t) 


j,s:r jtB > 0 

E 


< B + E E 


< B + E E 


< B + E E 


< B + E E 


j,s:rj tB > 0 

A j(t) 


Q (t) 


j,s:rj, a >0 




|Q(f),A(f) 


P(t)J2i Aj(t) sr ^ f „ \ P(t)J2 : Aj(t) 

3 J ((1 - /3(f))E[^(f)] - A 3 -,,(t)) - W |Q(t),A(t) 


m) 

Q (t) 


j,s:rj <a > 0 


3 3 + £ Qj,s(t) ((1 — P(t))Aj(t) — Aj' S (t)j — jW |Q(t),A(t) 


j,s:rj s >0 


IQW 

(38)' 


P*T,j A j(t) 


j,s:rj |S >0 




|Q(f),A(f) 


IQCO 


( 39 ) 


<5 + iE[(r-/3(f))£A J (t)]+E 

j 


£ Qy s (f) ((1 - /T)A,(f) - A iiS (f)) |Q(t) 


j,s:rj iB >0 


< 


S + -E[(r-/3(f))£A j (f)]+E 


£ Q,- S (t) ((1 - r)A,(f) - A,- a (t)) |Q(f) 


j,s:r jtB > 0 


where ((39]) follows as /3(f) minimizes /3£ Aj(f) ~ v PY,j s , r fl >o Qj,s(t) A j(t). 
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As (1 — (3*)\ + el £ C, following the same steps as in the proof of the Thm. [8] 


E 


_j,S- r 3, a>0 

As j3* — /3(f) < 1, we have 


E (E - m) 


— e y > Qj,s{t) 

j,s:r jtB >0 


E 


E («!,,(«+1) - q 


j,s:rj' B >0 


< B H— E] Aj — eE 


ei Qj,s(t ) 

j,s:r jtB > 0 


Following again the same steps we show that liiri sup^^ E Ylj S -r- >o QjA^) < 00 which implies the crowd 


system is stable. Hence, we can assume that E s . r >0 Qj,s[t) 
Thus we can write 


< C for some C < oo. 


E 


which in turn implies 


E {qU t )-q. 


j,s:rj'„> 0 


< 


HT+-^E[(/3*-/3(f))E^'W] 5 


t=i 


T 


j t =1 J j,s:r jtS >0 

Since H is a constant depending on arrival and availability statistics, zv and T can be chosen to be small and large 
respectively to ensure that the left side is arbitrarily small. The desired result follows by noting that 


E 


E(i-«‘))E a j(*> 


t =l 


= E 


T 


EET(z) 

t =i 3 


Also note that since the only requirement is the independence of A(f) and U(f) across time, the proof directly 
extends to settings with non-stationary arrival and availability processes. 

Now consider the ID setting and use the Lyapunov function Yhi j sQ'i j s- Similar to before, the Lyapunov drift 
can be bounded by 

B + E E QijA*) (EiW - \jAt)) IQW 

J,j,s:r jtB >0 

Note that 0 Qij >s (t)Aij(t) is equal to 

E E n E QidAt)\ Aj(t), 

l,j,s:rj >a >0 \ s / 

as Aj(t) = 221 Ai.jU) and JLTT ensures that the jobs are sent to the category with min/ (22s Qi,j, s (t)). So we have 


E 


E QuA*) - AijA*)) IQ(*) 


l,j,s:r jiB > 0 


= E 


< E 


E m , in ( ) Aj(t) - E IQW 


l,j-rj,s> 0 


l,j,s:rj lB > 0 


A( 


El Qi,j,s(t ), Aj(t) El 


j,s:r jtB > 0 


where (1 — (3*)\ = ^(1 — (3*)\ l for A ; + el £ CJ. The remainder of the proof is similar to the approach for the 
FD/FND/IND settings, i.e., the MWTA proof for (1 — 13*) A + el £ C 1 for each l. 

























